Abstract 



In previous work, the author introduced a measure-conjugacy invariant for sofic 
group actions called sofic entropy. Here it is proven that the sofic entropy of an 
amenable group action equals its classical entropy. The proof uses a new measure- 
conjugacy invariant called upper-sofic entropy and a theorem of Rudolph and Weiss 
for the entropy of orbit-equivalent actions relative to the orbit change cr-algebra. 
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1 Introduction 

The paper [BolOa] introduced a family of measure- conjugacy invariants referred to as sofic 
entropy for actions of sofic groups. This entropy is inspired by the classical Kolmogorov- 
Sinai entropy and shares many of its features. The main goal of this paper is to show that 
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the sofic entropy of an amenable group action equals its classical entropy. An alternative 
approach based on operator algebras is being developed by Kerr and Li [KL1, KL2]. The 
reader is encouraged to review [BolOa] for more background. 



1.1 Sofic groups 

To begin, let us recall the definition of a sofic group. 

Definition 1 (Sofic groups). Let G be a countable group. For any integer rrii > 0, let 
[mj = {1, . . . ,mj} and Sym(mj) denote the symmetric group on [m,]. Let £ = {o^}^ be a 
sequence of maps <7j : G — > Sym(mj) which are not assumed to homomorphisms. Then E is 
a sofic approximation to G if for every g,h E G, 

lim — #{p G [mj : a(g)a(h)p = a(gh)p) = 1 

and for every g ^ h E G, 

lim — #{p G [mi] : (r(g-)p 7^ <r{h)p) = 1. 

i->oo mj 

To avoid trivialities, we also assume lim^oomj = +00, which is necessarily true if G is 
infinite. G is so/zc if there exists a sofic approximation to G. 

Example 1. If G is residually finite then there exists a decreasing sequence {iVj}^ 1 of finite- 
index normal subgroups of G with DjiVj = {e}. Let <jj : G — > Sym(G/Ni) be the canonical 
homomorphism given by the action of G on G/Nj,. Then {0^}°^ is a sofic approximation to 
G. 

Example 2. If G is amenable then there exists an increasing sequence {F}^ of finite subsets 
of G such that \J i Fi = G and for every finite K C G 

r \KF Z AF Z \ 

lim : : = 1. 

i^oo J Fi I 

Let <7j : G — > Sym(Fj) be any map such that if / G Fi, g G G and gf G Fj then ai(g)f = gf. 
Then 1 is a sofic approximation to G. 

Sofic groups were defined implicitly by Gromov in [Gr99] and explicitly by Weiss in 
[WeOO]. Since finitely generated linear groups (i.e., subgroups of GL n (F) where F is a field) 
are residually finite (by [Ma40]) they are sofic. It is easy to check that a countable group is 
sofic if and only if all of its finitely generated subgroups are sofic. Thus all countable linear 
groups are sofic. It is unknown whether every countable group is sofic but an unresolved 
case is that of the universal Burnside group on a finite set of generators. Pestov has written 
a beautiful up-to-date survey [Pe08] on sofic groups and their siblings, hyperlinear groups. 
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1.2 Sofic entropy 

Let G be a countable discrete group. In this paper, an action of G is a triple (T, X, fi) 
where (X, /z) is a standard probability space and T = (T g ) ge G is a collection of measure 
preserving transformations T g : X — > X such that T g T^ = T g h for all g,h £ G. The notation 
Grv^X, /i) means (T,X,/i) is an action. Also Gr\(X, /i) means that G acts on (X, yu) by 
measure-preserving transformations and the product of g G G with x G X is denoted 

A process over G if a quadruple X := (T, X, //, 0) where (T, X, /i) is a G-action and 
: X A is a measurable map into a finite or countable set A. is called an observable 
and A is the range of the process. We will implicitly assume that the range of every process 
considered in this paper is finite. The next few paragraphs define the entropy rate of with 
respect to a sofic approximation £ for G in the special case in which A is finite. 

Suppose that a : G — > Sym(m) is a map and ip : {1, . . . , m} — > A is a function. In order 
to compare ip with 0, let I4 7 C G be finite (W is for window). Recall that A w is the set of 
all functions from W to A. Let <f) w : X — > A w be the map defined (f) w (x)(w) := <p(T w x). 
Similarly define ip^ : {1, . . . ,m} — > A w by ^^(^(w) = i>{&{w)p)- The measure pushes 
forward to a measure on A w . Similarly, if u is the uniform probability measure on 
{1, . . . , m}, then (ip^)^ is a measure of A 117 . Let dw((o', ip), 0) be the total variation distance 
between 0^ ' \x and (ip^)^. Explicitly, 

d w {M)A)^\m^-{^u\\^ \ E |^v(w) - www) • 

aeAW 

The so/ic entropy rate of the process X with respect to a sofic approximation E = {o"j}^ 1 
to G (where <Tj : G — > Sym(mj)) is defined by: 

MS,X) := inf inf lim sup bg #W V, ■ ■ ■ ,rm} ^ A: ^ VQ, 0) < e} 

WCGe>0 mi 

The first infimum is over all finite subsets of G. The entropy X) may alternatively be 
denoted by /i(S, 0) or /i M (S,0). 

In order to obtain a measure-conjugacy invariant, consider a special class of observables 
as follows. The map is generating if the smallest G-invariant a-algebra on X for which is 
measurable is the a-algebra of all measurable sets up to sets of measure zero. The following 
is part of the main result of [BolOa]. 

Theorem 1.1. Suppose Gr\(X, ji) . If 0i and 2 are finite generating observables of X and 
£ is a sofic approximation to G then /i(S,0x) = /i(£,0 2 ). 

Because of this result, the entropy of the action Grv T (X, /i) with respect to S is defined 
by /i(S,T) := /i(S,0) where is any finite generating observable (if one exists). 

In [BolOa] an alternative but equivalent definition of entropy is given based on partitions 
instead of observables. Also the entropy rate of an observable with countable range is defined 
under special conditions. That extension is not needed here. The main result of this paper 
is: 
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Theorem 1.2. If G is infinite and amenable, Grx(X,[i) and <fi is a finite observable then 
for any sofic approximation £ to G, /i(£,0) is the classical mean entropy rate of <j). 

The definition of classical mean entropy rate is reviewed in §3j By [R088] , if the classical 
entropy of an ergodic, essentially free system Grx T (X, fi) is finite (and G is amenable) then 
there exists a finite generating observable for the action. So the above theorem implies 
/i(S,T) is the classical entropy of the action in this case. 

1.3 Overview 

$2] discusses several variations on the definition of sofic entropy. These arise from allowing 
randomness in the sofic approximation and also in the approximations to the observable. £J3] 
provides basic background on amenable groups and classical entropy theory. §H proves the 
main Theorem II .21 in the special case in which G = Z. This uses the above variations on sofic 
entropy but is otherwise elementary. §5] discusses relative entropy theory; both the classical 
case and the sofic case. $6] proves the main results in entropy/orbit-equivalence theory that 
allow us to conclude Theorem 11.21 

Acknowledgements. I'd like to thank Gabor Elek for providing a rough outline of 
a proof of the main theorem based on quasi-tiling machinery. The proof presented here 
does not use his outline but it helped get me started. I'd also like to thank David Kerr for 
encouragement and especially Hanfeng Li for finding many errors in previous versions. 

2 Random sofic approximations, approximate processes 
and entropy 

It will be helpful to broaden the notion of sofic approximation to allow for 'random' sofic 
approximations, defined next. 

Definition 2 (Random sofic approximation). Let G be a countable group. Let {mj}^ 1 be 
a sequence of natural numbers, Sym(mj) the set of maps from G to Sym(mj) and u mi the 
uniform probability measure on [mj. Let Ki be a probability measure on Sym(mj) G . We say 
that the sequence IK = {K i }°? =1 is a random sofic approximation to G if for every g,h G G, 

lim ^ x u m% ({(a,p) G Sym(m;) G x [mj : a(g)a(h)p = a(gh)p}) = 1 

1— s-oo 

and for every g ^ h G G, 

lim ^ x u mi ({(cr,p) G Sym(mi) G x [mi] : a(g)p ^ cr(h)p}) = 1. 

i— >oo 

Remark 1. This notion generalizes sofic approximations in the following sense. If S = {o~i} c *L 1 
is a sofic approximation of a group G (where o~i G Sym(mj) G ) and 5i is the probability measure 
on Sym(mj) G supported on o~i then {(^j^ is a random sofic approximation of G. 
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Definition 3 (Sofic entropy). Let X : = (T, X, fi,4>) be a process over a group G with 
random sofic approximation IK = {Kj}^ 1 . Suppose : X — > A where A is a finite set. For 
each a G Sym(mj) G , finite W C G and e > 0, let Ni(a,W,e) be the number of functions 
ip : [mi] —j- A such that dw((cr, xjj), 0) < e (this is the notation used in §1.21) . The sofic 
entropy of X with respect to K is: 

h(K,X):= inf inflim sup l^l^M^l, 

WCGe>0 mi 

For example, if K' is any subsequence of K then h(K',X) < h(K,X). 

The definition above generalizes the notion of sofic entropy by introducing randomness 
into the sofic approximation. It is also possible to introduce randomness into the observables. 
This leads to a new notion of entropy called upper-sofic entropy (which was introduced 
implicitly in [BolOb]). To explain, it is necessary to have a notion of "approximate process" 
which is motivated by the definition of a sofic group. 

Definition 4 (Approximate process). An approximate process over G is a quadruple X = 
(T, X, fi, 0) where (X, /z) is a standard probability space, T = (T 9 ) 9£ g is a set of measure- 
preserving Borel maps T g : X — > X and (f) : X — A is a Borel map to a finite or countable 
set A called the range of the process. An approximate process is a process if T defines an 
action: i.e., T g h = T g Th for all g,h G G. The word 'approximate' is used to suggest that X is 
approximating some process. The definition by itself does not imply this but it is how these 
objects will be used. 

Definition 5 (Local statistics and distance between processes). Given a finite set W C G 
and an approximate process X = (T, X, /i, 0) define 0^ : X — > A w by 

4>t( x ) '■= [ w ^ 4>{T w x)] . 

Let (<f)j?)*fi be the pushforward measure on A w . This measure is called the W -local statistics 
ofX. 

Given another approximate process Y = (S, Y, u, ip) with range A we define 

d w (X,Y) := ±\\{<f%)*n - W>Y)*v\\i. 

X and Y are said to be equivalent if dw(X,Y) = for all finite W C G. Let V(G, A) be 
the set of all equivalence classes of approximate processes over G with range A. By abuse of 
notation, we do not distinguish between an approximate process and its equivalence class. 
Let V(G, A) have the smallest topology such that for every finite W <Z G the function dw is 
continuous with respect to the product topology on V(G, A) x V(G, A). 

Definition 6 (Approximate processes from random sofic approximations). Let K = {Kj}^ 1 
be a random sofic approximation to G. Let r/j be a probability measure on Sym(mj) G x A^™^ 
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where A is a finite or countable set. Suppose that the projection of rji to the first factor is 
Ki. For each g e G define 

&i(g) : Sjm( mi ) G x A K] x [m, : ] Sym(m,) G x A [m > ] x [m t ], &i(g)(<T,ip,p) = (a,ip,a(g)p). 
Define 

X< : Sym(m t ) G x A [mi] x [mj -» A, Xi{^,v) ■= ^(p)- 

Define 

Xi := Sym(m;) G x A K] x [m;],^ x u^Xi)- 

Then X« is the approximate process constructed from rji. 

The sequence {X i \ c *L 1 is adapted to the sofic approximation K = {fti}^ if it arises from 
the above construction for some sequence of measures {r]i} c *L 1 . 

Definition 7. If fi is a probability measure on a finite or countable set X, then 

Hfr) :=-^MW)M/i(W))- 

By convention log(O) = 0. 

Definition 8 (Upper-sofic entropy). Let rji,Ki, etc. be as in definition [6] and let rji = 
f Vi j(J dni(cr) be the decomposition over Ki. So v^ a is a probability measure on the set 
{(a,'0 E Sym(mi) G x A^}. Then define 

/i(X t ) := — / H{is i>a ) dKi(a). 
m J 

This definition depends implicitly on rji (which might not by determined by the equivalence 
class of Xi). 

The upper-sofic entropy of a finite-range process X = (T, X, /x, (f>) with respect to K is 
defined by 

h(K,X) := sup limsup h(Xj) 

j-KX) 

where the supremum is over all sequences {Xj}^ =1 of approximate processes adapted to 
K' (where K' is a subsequence of K) such that lim^oo Xj = X. If no such exists then 
h(K,X) = —oo. The upper-sofic entropy h(T,,X) can alternatively be denoted by h(K,</>) 
or /i^(K, 0) if it is desirable to emphasize the dependence on \x or 0. 

If each Ki is supported on a singleton set {<x;} C Sym(mj) G then let £ := {o"i}j e N and 
define h{E,X) := h(K,X). 

Using the methods of [BolOa], it can be shown that upper-sofic entropy is an invariant: 
if <p, ij) are two generating observables with finite range then h(K., <j)) = h(K, ip) (but this is 
not needed here). Next upper-sofic entropy is related to sofic entropy (which will be referred 
to as lower-sofic entropy so as to distinguish it). 



7 



Definition 9 (Strong convergence of approximate processes). As above, let {X i } c ^ l be a 
sequence of approximate processes constructed from measures {rii]°^ l on Sym(mj) G x A^™^ 
as in definition [6j Suppose that the limit lim^oo Xi = X is a process over G. The sequence 
{X i }°^ 1 converges to X strongly (denoted lim^oo Xi = X strongly) if for every finite W C G 
and every e > 

lim m({{a^) G Sym(m 4 ) G x A™ : d w ((a,i>),<f>) < e}) = 1. 

Proposition 2.1. Let X be a process over G and let K be a random sofic approximation. 
Then 

h(K,X) = sup lim sup h(Xj) 

j-s>oo 

where the supremum is over all sequences {Xj}JL 1 of approximate processes adapted to K' 
(where K! is a subsequence ofK) such that lim^oo Xj = X strongly. 

Proof. The proof is an exercise in understanding the definitions. □ 
Corollary 2.2. Let X be a process over G and let K be a random sofic approximation. Then 

h(K,X) < h(K,X). 



3 Amenable groups 

Definition 10. Let G be a countable group, F,KgG finite sets and e > 0. Then F is 
left-(K, e)-invariant if 

\KF fl F\ , 

^ > 1 - e 



where KF = {kf e G \ k G K, f G F}. A left-F0lner sequence of G is a sequence {F n }'^ =l 
of finite subsets of G such that for all finite K C G and all e > there exists an iV such 
that n > N implies F n is left-(i^, e)-invariant. G is amenable if there exists a left-F0lner 
sequence for G. 

Definition 11. Let G be an amenable group with left-F0lner sequence {F n }™ =1 . Let X = 
(T, X, /i, 0) be a process over G with range A. For a finite W C G, let 4> w : X — > A 1 ^ be 
the map <p w (x)(w) = 4>{T w x). The classical entropy of A is defined by 

h( X) : = lim 

n— ^oo | F" n | 

where H(-) is as in definition [71 Some alternative notation for the entropy rate are: h(X) = 
h(4>) = h^((p) = hfj,(T,(f>) = h(T,(f>). The entropy rate does not depend on the choice of 
F0lner sequence (e.g., [0185]). 
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4 The case of Z 



The purpose of this section is to prove: 

Proposition 4.1. Let K = be a random sofic approximation o/Z. Let X be a process 

over Z with finite range. Then /i(K, X) = h{K, X) = h(X). That is, classical entropy, sofic 
entropy and upper-sofic entropy agree. 

In order to prove this, we will reduce to the case when the sofic approximation K is 
particularly simple (given by finite quotients of Z). For this, we need to define what it 
means for two sofic approximations to be close. 

Definition 12. Let W C G be finite and let e > 0. Suppose that a : G — > Sym(m) and 
a' : G — > Sym(m') are two maps and there exist subsets Q C [m], Q' C [m'} and a bijection 
(3 : Q -»■ Q' such that 

1. a'(w){3(q) = p(a(w)q) for all w E W and q E Q with a(w)q E Q; 

2. <r{w)p-\<f) = /TV'H?') for a11 w e w and q' e Q' with v'Mq' e Q'; 

3. \Q\ > (l-e)m, |Q'| > (1 - e)m' . 

Then a and o' are said to be (W,e) -close to each other. We say two probability measures 
k,k' on Sym(m) G , Sym(m') G respectively are (W,e)-close if there is a probability measure 
on Sym(mj) G x Sym(m^) G ' with marginals k and such that $(G(W, e)) > 1 — e where 
G(W,e) is the set of all (a, er') E Sym(m) G x Sym(m') G that are (W, e) -close to each other. 
Finally, we say that two random sofic approximations to G, K = and L = {A i }^ 1 

are asymptotic if for every finite W C G and e > 0, Ki is (W 7 , e)-close to Aj for all sufficiently 
large %. 

The following theorem is a special case of Theorem 15.51 proven in the next section. 

Theorem 4.2. Let X be a process over a group G with random sofic approximations K. and 
L. IfK. and L are asymptotic then h(K.,X) = h(h,X) and h(K.,X) = h(h,X). 

This motivates the next result, which is a minor extension of |ES10j . Theorem 2. 

Theorem 4.3. Let G be an amenable group and IK = {fti}^i,L = {Aj}^ 1 be random sofic 
approximations to G. Suppose that for each i, Ki : G — >• Sym{rrii), A« : G — > Sym{m'^) and 
linij-j.oo ^f = l. Then K and L are asymptotic. 

To prove this, we need some terminology adapted from |ES10j . 

Definition 13. Let G be a finitely generated group and S = S^ 1 C G a finite generating 
set for G. Let cr : G — > Sym(m) be a map. Then a is an r- approximation to (G, S) (for r a 
positive integer) if there exists a set V C [m] such that \V\ > (1 — \)m and for every v E V 
and every sequence g%, . . . , g r E S U {e}, 
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and if g[, . . . , g' r G S U {e} are such that g\ ■ ■ ■ g r ^ g[ ■ ■ ■ g' r then 

v(9i92 ■ ■ ■ 9r)v ^ o-(g[g 2 ■ ■ ■ g' r )v. 

Definition 14. If T C G and o : G — > Sym(T) is a map then a is a copy of T if a(g)t = gt 
for every t G T and g G G with gt G T . 

Definition 15. If o~i : G — > Sym(mj) (i — 1, . . . , n) are maps then o\ + • • ■ + o~ n is defined 
to be the map from G to Sym(mi + • • ■ + m n ) defined by 

(ax H h cr n )(g)p := (Ji{g)p 

where i is determined by: if p < mi then i = 1; otherwise i is such that 5^}=i m i < P — 
Y^j=i m j- Also if r > 1 is an integer then ra± := 0"i + • • • + o~\ [r summands). So if 
a = (a±, . . . , a r ) is a vector of positive integers and (ci, . . . , cr r ) is a vector of maps then 
a • (o"i, . . . , o" r ) is defined to be the map ai<Ji + • • • + a r a r . 

Proposition 4.4. Let G be a finitely generated amenable group with finite generating set 
S = S" 1 . Let {Fj}^ be a left-F0lner sequence for G . Then for each integer r > there exists 
an integer R(r) > (which also depends on {Fi}^), a finite subsequence {T 1; . . . ,T n } C 
and a vector a = (ati, . . . , a n ) of positive natural numbers such that the following 
holds. Every T{ satisfies |STjAT,;| < \Ti\jr and every map a : G — > Sym(m) which is an 
R(r)- approximation to (G, S) is (S, -)-close to an integer multiple of a ■ (o"i, . . . , a n ) where 
o~i is any copy ofTi. 

Proof. This is Proposition 2.8 of |ES10] (in different terminology). □ 

Proof of Theorem \4-3[ Let W G G be finite and e > 0. It suffices to show that Ki is (W, e)- 
close to Xi for all sufficiently large i. Let S = W U W~ l . Without loss of generality, we may 
assume G is generated by S. 

Given integers m, r > 0, let X(m,r) be the set of all (a, p) G Sym(m) G such that 

o-(g 1 )a(g 2 ) ■ ■ ■ cr(g r )p = a(g x ■ ■ ■ g r )p \/g u g 2 , . . . , g r G S U {e} 

and if g[, . . . , g' r G S U {e} are such that gig 2 ■ ■ ■ g r ^ g'\g' 2 • • • g' r then 

<? {9x92 ■ ■ ■ g r )p + o-(g[g' 2 ■ ■ ■ g' r )p. 

For o G Sym(m) G , let X a (m,r) be the set of all p G [m] with (a,p) G X(m,r). 

Now let K = {/tj}^ be a random sofic approximation. By definition, there exists an N 
such that for all i > N, /tj x u mi (X(mi,r)) > 1 — \ where u mi is the uniform probability 
measure on [mj. Let a be an element of Sym(mj) G chosen at random with law /tj. By 
Markov's inequality, 

P(mj - \X a (mi,r)\ > m^/r) < E[mj - \X a {m h r)\\(r/mi) < (m 4 /r 2 )(r /mi) = (1/r). 
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Therefore, 



Ki({a E SymK) : \X a {m u r)\ > m t {l - 1/r)}) > 1 - (1/r). 

Note that each a G Sym(m;) G with \X a (rrii,r)\ > mj(l — 1/r) is an r- approximation to 
(G,S). 

By the previous Proposition, there exist a finite sequence {T±, . . . , T n } of finite sets Tj C G 
and a vector ct = (cti, . . . , a n ) of positive natural numbers such that the following holds. 
Every Tj satisfies \STjATj\ < \Tj\/r and for every map o : G — > Sym(mj) which is an 
r-approximation to (G,S) there is an integer fcj(er) such that a is (S, ^)-close to ki(a)a ■ 
(a i, . . . , a n ) where Oj is a copy of Tj for 1 < j < n. 

We would like to choose the integers fcj(cr) to be independent of a (although they must 
depend on i). So let ki be the minimum of fcj(cr) over all maps a : G — >■ Sym(mj) which are 
r- approximations to (G, S). We claim that each such a is (S 1 , ^)-close to kicx. • (a"i, . . . , <r n ). 

To prove the claim, first note that if a : G — >■ Sym(m) and a' : G — > Sym(m') are any 
two maps that are (S, ^)-close to each other then 1 — 1/r < ^ < , . So let q be the 
integer such that a ■ (<7i, . . . , cr„) is a map from G into Sym(g). Then if a is any map from 
G into Sym(mj) which is an r-approximation to (G, 5) then 1 — 1/r < fc '^ g < 1 J 1 ^ r ■ Since 
this is true for every such a, it follows that for any such a, 

2 ^ kiQ 



1 - 2/r < (1 - 1/r) 2 < 



ki(a)q' 



Clearly, ki(a)a ■ (<7i, . . . , a n ) and k^a- (<7i, . . . , cr n ) are (S, e)-close for any e with 1 — e < k k fy q - 
So we can choose e = 2/r. 

It follows readily from the definition of closeness that if a is (S, e)-close to a', a 1 is (S, en- 
close to a" and e + e' < 1 then a is (S*, e + e')-close to a". So: if a : G — > Sym(mj) is an 
r-approximation to (G, S) then it is (S, l/r)-close to ki(a)a ■ (a±, . . . , a n ) which is (S, 2/r)- 
close to fcjQ; • (<7i, . . . , cr„). Thus cr is (S, 3/r)-close to h L a • (a±, . . . , <7 n ) (whenever r > 3). 

Let Ti = kia ■ (<7i, . . . , <r n ). From the claim it follows that ^ is (S*, 3/r)-close to the 
probability measure supported on {r^}. 

By choosing N larger if necessary, we may assume that for % > N, there is (by a similar 
argument) an integer k! such that if r[ = k^a ■ (<7i , . . . , a n ) then Aj is (S, |)-close to the 

probability measure supported on {r/}. Note 1 - 3/r < ^ < So 

(1 - 3/r) 2 mmK/m^,m7m i ) < ^ < (1 _ 3/r)2 

This implies Tj and r/ are (S*, 1 — (1 — 3/r) 2 min(mj/m^, m^/mj))-close. So «j and Aj are 
(S 1 , ^ + 1 — (1 — 3/r) 2 min(mj/m^, m^/mj))-close to each other. Since lim^oo = 1 and r 
can be made arbitrarily large, we have shown that for every e > there exists an N such 
that i > N implies ^ and A« are (S, e)-close which implies the theorem. 

□ 
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Lemma 4.5. Let K = be a random sofic approximation ofL. Let X = (T,X, fi,<f>) 

be a process over Z with finite range A. Then h(K, X) = h(K, X). 

Proof. By Theorem 14.31 and Theorem 14.21 it suffices to show h(Tl, <p) = /i(E, </>) where £ = 
{ai\°l 1 and o"j : Z — > Sym(mj) is the homomorphism with <7j(l) = (1,2,..., rrii). 

By Corollary it suffices to show that h(T,,(f)) > h(T,,<fi). Let e > and W C G be 
finite. It suffices to show that 

. , log (#{"0 :{!,-••, m n } ->• A : d w ((a n ,^),(j)) < e}) 
hmmf > n(h, <p) — e. 

n->oo m n 

The definition of 4>) implies there exists an i and a probability measure z/j on 
such that if 5i is the probability measure supported on the singleton {<Tj} and Xi is the 
approximate process constructed from rji :— 5i x ^ then 

• d w (Xi,X) < e/2; 

• > 0) - e/2; 

• there is an X > such that W C [-N, N] and ^ < e/2. 

After perturbing z/j if necessary, we may assume that there is an integer d > such that dvi 
is integral (i.e., dui({^}) 6 Z V( 6 A^ m ^). We may also assume that is <7j-invariant by 
replacing it with ^- X^iO 7 !)* 1 ^ if necessary. 

Let n > i be a large number (to be specified later). Let k = L^"J- We will say that 
a function ip : [m n ] A is good if for every £ G A'" 1 ^, the number of j with < j < kd 
satisfying 

ip(jrrii + p) = £(p), VI < p < m s 

is exactly fcdz/j ({£}). For such a ^ let be the approximate process constructed from 
8 n x S^p where 5^ is the probability measure concentrated on {ip} C Al 7 "™!. 

In order to estimate g?vf(X^, Xi), let w mn be the uniform probability measure on [m n ] 
(so ijjY u m n is the VT-local statistics of by definition EJ). If u' m is the uniform probability 
measure on 

K = {p G [m n ] : p = jrrii + g for some < j < kd and some 1 + N < q < rrii — N} 

then i/ifa^ is the W- local statistics of Xi. This uses the fact that Vi is <7;-invariant and 
W C [-X,X]. Since |X| = kd{mi - 2N), it follows that 

dw(Xib)Xi) < 1 -. 

Let N be large enough so that if n > N then 1 — kd ( m ^- 2N ) < e/2. By choice of Xi, this 
implies dw{X$, X) < e. We will now assume that n > N . 
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The number of good functions if) : [m n ] — > A is 



\A\( m ™- kdm <\kd)\( Yl ( kdu i({t})Y 

Stirling's formula implies that 

log \\A\^- kdm ^(kd)\( n^Ki 

lim — 

Therefore, 

log (#{V>:{1, 



-i 



-In 



lim inf 



At! 



,m n } A : d w ((a„,V), 



<e}l > H{u i 



rrii 



Because e > is arbitrary, this implies the lemma. 



h(Xi)>h{i:,<l>)-e. 



□ 



Lemma 4.6. Let K = 6e a random sofic approximation ofL. Let X = (T,X, //,</>) 

6e a process over Z. T/ien X) > h(X). 

Proof. By Theorems 14.31 and 14.21 and the previous lemma it suffices to show that h(T,,<f)) > 
h(X) where X = {o~ i \ ( ^ 1 and (Tj : Z — > Sym(mj) is the homomorphism with <7j(l) = 
(l,2,...,mi). 

Let (f) m > : X ->■ be the map m '(x)(p) = 0(T p x) Vx G X,p G [m*]. Let 77, := 

be the pushforward measure on A^ mi \ Let Xj be the approximate process constructed from 
5i x rji where 5, is the probability measure concentrated on {a{\ C Sym(mj) z . The F0lner 
property of the sequence of intervals [raj] C Z implies that lim^oo Xj = X. The definition 
of h(X) implies lim^oo h{Xj) = h(X). This implies the lemma. □ 

Proof of Proposition \4-l\ By Theorems 14.31 14.21 and the previous lemma, it suffices to prove 
that if X = {o~ i } c *L l where o~i : Z — > Sym(raj) is the homomorphism with (Xj(l) = (1, 2, ... , rrii) 
then h(Yl, X) < h(X). Let be a sequence of probability measures on such that 

if {Xj}^ 1 is the sequence of approximate processes constructed from rji := 5i x z/j (where <5, 
is the probability measure concentrated on {o~i\ c *l 1 C Sym(raj) z ) then 

lim X; = X, and lim /i(Xj) = /i(£,X). 

i— >oo i— >oo 

Let X = (T, X, /x, <f>) where : X — > A. Using a standard trick, X is equivalent to a process 
of the form (r, A z , //, </>') where r : A z — > A z is the shift map r(y)(n) := y(n + 1) and 
0' : A z A is the time-0 projection <p'(y) := y(O). To be precise, let <ft z : X — >• A z be 
the map z (x)(n) := (fi(T n x) for 2 G X, n G Z. This map is equivariant. X is equivalent 
to (r, A z , /i', 0') where // is the pushforward measure </>J/j. So without loss of generality, we 
will assume that X = A 1 , T = r is the shift map and : A z — > A is the time-0 projection. 
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Let vr n : (Al m "]) z ->■ A z be the map defined by 

<K n [%{im n +j) := *(z)(j), W G (A^f,i G Z, j G [m n ]. 

Let z/^ be the measure on (^4i m "J) z equal to the product of Z-copies of v n . Let //^ = (^ n )*(^n 
be the pushforward measure on A z . Let 



1 m n 
— _L \^ 3 l 



Note that \x n is r-invariant. 

For an interval [a, 6] C Z, let 0^ : A z — > A^ a ' b ^ be the projection map. By concavity of 
entropy, 

tt( ,[-m n N+l,m n N} x tt r ±[-m n N+l,m n N] , \ 

K (0) = lim 3^ M > lira ^! ^2 

; v^oo 2m n X ~ n^oo 2m n N 

lim 2iVg K) = gfrn) 
v^oo 2m n N m n 

Since linij_ ) , 00 Xj = X, it follows that lim ny u n = in the weak* topology on M(A Z ), the 
space of all r-invariant Borel probability measures on A z . It is well-known that the function 
A G M(A Z ) i — y h\((j)) is upper semi-continuous on M(A Z ). For example, see [G103, Lemma 
15.1 page 270]. It follows that 

h(Y>,(J)) = lim sup — ^— < lim sup ((f)) < /i M (0) 

n— >oo Win n— >oo 

as required. □ 



5 Relative entropy 

Definition 16 (Factors of approximate processes). Given an approximate process X = 
(T, X, /i, 0) with : X — > A and a function (3 : A — > B, let (3 o X be the approximate process 

/3oX:= (T,X,/i,/3o0). 

If an approximate process Y is constructed from a measure rji on Sym(m i ) G ' x A^" 1 ^ as in 
definition [6] then f3 o Y has an alternative description as follows. Let 

f3 : Syminii f x A [m * ] -)• Sym^) x = (<r,/3o£). 

Then /3 o K is equivalent to the process Z constructed from the pushforward measure 0*T]i. 
(By equivalent, we mean that dw(fl °Y, Z) = for every finite W C G in the notation of 
definition Hj). 
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The next lemma follows immediately from the definitions. 

Lemma 5.1. If{X i } c *l 1 is a sequence of approximate processes with range A, lim^ooXj = X 
and (3 : A — >■ B is a map then lim^oo j3 o Xj = (3 o X . Moreover, if lim^^ Xj = X strongly 
then Hindoo j3 o Xj = (3 o X strongly. 

Definition 17 (Relative entropy). Let G be a countable amenable group acting by measure- 
preserving transformations on a standard probability space (X, £>, /i). Let : X — > A be a 
finite observable, £ be a sofic approximation to G and T C B be a G-invariant a-algebra. 
Define 

where the infimum is over all finite-range J-"-measurable observables ip : X — >■ 5 and V ^ : 
X — >• A x 5 is the observable (j)\/ip(x) = (<f>(x),tp(x)). In case J 7 is the G-invariant a-algebra 
generated by an observable ip then we write h^{(p\ip) = /i^(0| J 7 ). In case X = (T, X, //, 0) is a 
process over G and ^ = fiotfi for some /3 : A — > B, we write h(X\(3 o X) = h(X) — h(/3oX) = 
hfj,((f>\i(}). The first equality holds from the Abramov-Rohlin formula [WZ92]. 

We can now define relative sofic entropy (in a special case). 

Definition 18 (Relative sofic entropy). Let IK be a random sofic approximation to G. Let 
X be a G-process with finite range A and (3 : A — > B a map. Define 

h(K,X|/3oX) =suplimsup/i(X j ) - h(p o~Xj) 

where the supremum is over all sequences {X J }°^ 1 adapted to K' (where IK' is a subsequence 
of IK) such that lini^^ Xj = X. Similarly, let 

h(K,X\poX) = snphmsnp h(Xj) - h(/3 oXj) 

where the supremum is over all sequences {Xj}JL l adapted to IK' (where K' is a subsequence 
of IK) such that lim^oo Xj = X strongly. If X = (T, X, fx, 0) and -0 = (3 o then an 
alternative notation for relative entropy is: 

h(K, 0|^) := h(K, X\(3 o X), h{K, 0|V) := h(K, X\(3 o X). 

We may also write h^K, <f>\if)) or h^(K, 0|-0) if it is desirable to emphasize the dependence 
on the measure fi. 

Before moving on, it is worthwhile to record some inequalities relating the entropy of 
direct products to the entropies of their direct factors. To be precise if X = (T, X, //, 0), 
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Y = (S, Y, v, iff) are two approximate processes over G then their direct product is the process 
X x Y := (T x S, X x Y, /j, x is, (f> x ip) where 

(T x S) g (x, y) := (T g x, S g y) \/g G G, (x, y) G X x Y. 

Let 7Tb : A x B — > B and tta '■ A x B — > A be the projection maps. To simplify, we let 
(for example) 

h(K, X x Y\Y) :=h(K,XxY\n B oXxY). 

Lemma 5.2. If X, Y are two processes over G as above and K is a random sofic approxi- 
mation then 

h(K,XxY) < h(K, X) + h(K, Y) 
h(K,XxY\Y) < h(K,X). 

Similar statements hold with lower-sofic entropy in place of upper-sofic entropy. 

Proof. Let X = (T, X, /i, 0) and Y = (S, Y, v, if)). Let {Z i \ c *L l be a sequence of approximate 
processes adapted to IK so that lim^oo Zi = X x Y. Then {tta ° Z i } c *i 1 converges to X and 
{■kb ° Z i } c *L l converges to Y . Moreover, 

h(Zi) < h(ix A o Zi) + h(n B o 

This and Lemma 15.11 imply 

h(K,Xx Y) < h(K,X) + h(K,Y), h(K,Xx Y\Y) < h(K,X). 

The proofs for lower-sofic entropy are similar. □ 

If G is non-amenable, then there are examples showing that some of the inequalities of 
the lemma above can be strict. However, in the special case of Bernoulli actions, we have 
equality. To be precise, let (fl, u>) be a standard probability space. G acts on the product 
space (Q G ,u G ) by T g y(f) = y{g~ l f) € Q G ,g,f G G. Let : Q G Q be the map 
(p(y) = y(e). The process (T, Q G , u G , <f>) is the Bernoulli process over G with base (Q, u). 

Lemma 5.3. If X is any finite-range process over G, Y is a Bernoulli process with base 
(fl,uj) (where Q is finite) and K is a random sofic approximation to G then 

h(K,XxY) = h(K, X) + h(K, Y) 
h(K,Xx¥\Y) = h(K,X). 

Similar statements hold with lower-sofic entropy in place of upper-sofic entropy. 

Proof. The first statement above was proven in [BolOa] for non-random sofic approximations 
and with lower-sofic entropy in place of upper-sofic entropy. We will handle here only the 
case of upper-sofic entropy as the other cases are similar. 
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Let {Xj} ( jL 1 be a sequence of approximate processes adapted to W = {nj}°? =1 (where 
IK' is a subsequence of K) such that hin^oo Xj = X and lim^oo h(Xj) = h(K,X). Let 
r/j be the probability measure on Sym(m.,) G x A^ m ^ from which Xj is constructed (as in 
definition [6]). Let fjj = r/j x be a probability measure on Sym(mj-) G x A^ m ^ x Vt^"" 1 ^ = 
Sym(mj) G x (A x fi)[ m jl and let be the approximate process constructed from fjj. Note 
h(Z j ) = h(X j ) + H(u). _ _ _ 

We claim that lim^oo Zj = X x F. To see this, let V C G be finite. Let Qj(V) be the 
set of all (a, p) G Sym(m.j) G x [my] such that a(g)a(h)p = cr(gh)p for all g,h E V and for 
every g ^ h with g,h £ V, a(g)p ^ a(h)p. We consider Gj(V) x (A x fl)l m j] as a subset of 
Sym(m.,) G x (Ax(l)' m il x [my] in the obvious way Then lim J ^ 00 r/ J XM mj (^(l / )x(Axfi)[ m j]) = 
1 because IK is a sofic approximation. 

In general, if r is a measure on a set J and J C J then we write r|J to denote r 
restricted to Jo- Let Xj '■ Sym(m,,) G x (A x f2)[ mj l x [rrij] -)• A x O be the projection map 
Xj{ a itiP) = £(p)- Then the measure (xJ)*(Vj x u mj\Gj(V) x (Ax fi)t m ^) splits as a product 
Tj x w v for some measure Tj on A v . Let X = (T,X, f/,,if)). Thus, 

lim x Y) = \\m\\\(x])*{fij x u m „) - {4) x cf))X {ix u G \\i 

j— too Z 

= lim IwixJUvj x u m jS(V) x (Ax - (^ x 0)^ x a/ 3 ^ 

= lim ~||(xp*(^ x x - ^Vlli 

J— >oo Z 

= lim d v (Xj,X) =0. 

j-+cc 

The first equality holds by definition of dy. The second one holds because lim^oo fjj x 
u mj (Gj(V) x (Ax f2)Kl) = 1. The third equality holds because (xJ)*(Vj x u mj \G(V) x (Ax 
Q)I m j]) splits as a product TjXu v and (^x^^xw 6 splits as the product ^f/ixw 1 '. The last 
equality holds because lim^oom x u mj (Qj(V) x = 1 since IK is a sofic approximation. 

Since lim^oo Zj = X x Y, it follows that 

h(K, X x Y) > h(K, X) + H(u), h(K, X x Y\Y) > h(K, X). 

By the previous lemma, it now suffices to prove that h(K, Y) = H(oS). 

Applying the equation above to the case when X is trivial, we see that h(K., Y) > H(u). 
Suppose that {Y j}f=i is a sequence of approximate processes adapted to IK' = {Kj}'jL 1 
(where IK' is a subsequence of K) such that lim^oo Yj = Y and limj^oo h(Yj) = h(K,Y). 
Let rjj be the probability measure on Sym(mj) G x f2' m ^ from which Yj is constructed. Let 
7r : Sym(mj) G x flt m jl x [rrij] — > Vl be the map n(a, £,p) = £(p) and let ojj = 7r*(m x U[ mj ]) 
be the pushforward measure. Because lim^oo Yj = Y, ujj converges to to. 

We claim that h(Yj) < H(uj). Let m )(7 be the fiber measure of m over a G Sym(m.,) G . 
This is a measure on f2' m -^. By abuse of notation we let n : fl)- m ^ x [rrij] — > f2 denote the 
map 7r(£,p) = £(p) and for p G [rrij] let ir p : Q^ m ^ — >• Q denote the map 7r(£) — £(p). 
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By concavity of the entropy function, 



rrij m< 



- H ( ^~ S ( 7I >)*(^>) = H ( 7r *(^.- 



X M[, 



]))■ 



So by concavity of entropy again, 

— 1 If 

h ( Y » = —H^Mi) = — / H (Vj,a) dKj(a) 

f 1 1 j II Lj j 

< J H(n^r] jta x Ufa.])) dKj(a) < H (J n*{r} j;CT x U] m .]) dK^aU = H[w 3 ). 

So _ _ 

h(K,Y) = lim h(Yj) < IimsupfT(wj) = 

Since we have already shown that h{K, Y) > H(oj), we now know that h(K, Y) = H(lo). 

□ 

Lemma 5.4. If X is any finite-range process over G, Y = (T, Q G , u G , (p) is a Bernoulli 
process with base (fl,oj) (where Q is finite), Q' is a finite set, <p' : Q G — > Q' , (3 : Q' — > Q are 
Borel maps such that (f) = /3o(j)',Y = (T, fi G , oo G , (j)') and K is a random sofic approximation 
to G then 

h(K,Xx Y'\Y') = h(K,X), h(K,Xx Y'\Y') = h(K,X). 

Proof. We will only prove the statement for upper sofic entropy as the statement for lower 
sofic entropy is similar. By Lemma 15.21 it suffices to prove h(K,X xY\Y) > h(K,X). 
Let K = {^j}^! where «j is a measure on Sym(mj) G . Let F : Sym(mj) G — > Sym(mj) G be 
the map F(a)(g) = a(g) if g ^ e and F(a)(e) is the identity permutation. Let := F^. 
Observe that K' := is asymptotic to K. Therefore, by Theorem 15. h\ we may assume 

without loss of generality that K = K'. 

Define Xj, rjj, Kj (for j = 1,2,...) as in the proof of the previous lemma. For V C G 
finite with e G V, a G Sym(m J ) G and C G £l [mj] , define ([a, V] G by 

CW,V](p) := [«6 7^CW«)P)]- 

Let ^J", be the measures on Sym(m.,) G x A^ m ^ x (fi y )[ m j] obtained by pushing rjj x u;[ m jl 
forward under the maps 
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respectively. Let Y ■ , Z ■ be the approximate process constructed from r)J, fjj respectively. 
Also let Yj = Yj . As in the previous lemma we obtain 

lim Z^ = X x Y V , lim F^ = Y V 

where Y V = (T, ft G , co G , <f) v ). 

Suppose that fl : Vl v — >■ Q' is a map such that : fi y — ?- O is the map at the identity 
element. Let id denote the identity map on A. Then 

lim id x /3' o Z^ = X x (fl o F V ) 

j—toc 

and because the map (cr, 

£, C) >->■ (o", f , V]) is injective, /i(id x o Z, ) = h(ZA which 



implies (by the previous lemma) 



lim h(id x fl o Z V A = h(K, X x Y) = h(K, X) + H(uj). 

j-*oo 

Also the map (cr, Q >->■ (cr, /3'C[cr, V]) is injective. So /z(fl o yJ) = h(Yj). By definition, Yj 
is the approximate process constructed from Kj x uM 1 ^. So h(Y j) = H(cu). 

Because (j) — f3 o 0', there exist a sequence {\4}^ =1 of finite subsets of G (with e e 14 for 
all k) and a sequence of maps /3' k : Vt Vk — >■ such that : Q Vfc — >■ f2 is the evaluation map 
at the identity element and f3' k (fi Vk limits on <fi' in the following sense: 

hm im {{xeX: ftWx) = <U(x)}) = 1. 

Note that 

lim lim id x /3' k o = lim id x fl o X x F Vfc = X x F' 
lim lim /i(id x^olf) = /i(K,X) + if (a;). 

fc— s-ooj— s-oo 

So a diagonalization argument implies that, without loss of generality we may assume {V^}^ 
and {[3' k } k x L 1 are chosen so that there is an increasing sequence {i{j)}fLi of positive integers 
with 

S Wx 4°?w = XxF' 

3/ - "V^O) > 



lim^x^oZl 1 ) = /i(K,X) + if(a;). 

Let vr : A x ft' ->■ Q! be the projection map. Note fl (j) o F^ = vr o (id x fl (j) ) o Z^f. So 
/i(tt o (id x fl (j) ) o Z^f) = H(oj). Therefore, 

lim h(id x fly, o Z$f ) - h(rr o (id x fly,) oZjgf ) = /^(K,X). 
Thus /i(K,Ix F'|F') > /i(K,X) as required. 

□ 
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Next, we extend Theorem 14.21 to relative entropy: 

Theorem 5.5. Let X be a process over a group G with random sofic approximations K and 
L. We assume A is the range of X and : A — )■ B is a map (both A and B are finite). IfK 
andIL are asymptotic thenh(K,X\Xo/3) = h(h,X\Xo/3) and h(K, X\Xo0) = h(h,X\Xo/3). 

In order to prove this, we will need a few lemmas. We say that elements a, a' G Sym(m) 
are conjugate if there exists an element r G Sym(m) such that for all g G G, o~(g) = ra\g)i . 
We say that probability measures k, k' on Sym(m) G are conjugate if there exists a probability 
measure $ on Sym(m) G x Sym(m) G ' with marginals k, k' such that d is supported on the 
set of all conjugate pairs (a, a') G Sjm(m) G x Sym(m) . 

Lemma 5.6. Let k, k' be conjugate probability measures on Sym(m) G . Let rj be a probability 
measure on Sym{m) G x ylH with projection k and let X be the approximate process con- 
structed from 7]. Let j3 : A — )■ B be a map to a finite set B. Let W C G be finite. Then there 
is an approximate process X constructed from a measure rf on Sym{m) G x A™ such that 

1. the projection ofrj' to Sym(m) G is k' ; 

2. d w (X,x) = 0, 

3. h(x'\/3oX')>h(X\l3oX). 

Moreover, if X = (T, X, /i, <fi) is a process over G then for any e > 0, 

7/({(<7,V): d w {(*,iP),<t>)<e}}=r ] ({{*,TP): d w {(a, 0) < e}) . (2) 

Proof. Case 1. Suppose k' is supported on a singleton {a}. Then K-a.e. a' is conjugate 
to cr, so k is supported on a finite set which we denote by {ai, . . . , cr„}. For each i there is 
an element Tj G Sym(m) such that cr = Tia^Tf 1 . Let 7/ be the measure on Sym(m) G ' x A^ 
defined by 

r/({(a,^)}) :=)|({K^ori): l<*<n}). 
Then 77' projects to /«' and 

^(a{g)p) =ipo Ti (ai{g)Tr l p) , Vp G [m], ^ G A H , l<i<n,geG 

implies c?vy(X,x') = where X is the approximate process constructed from rj'. It also 
implies (j2J). 

For 7 G -B'" 1 ^ with r/({ (a, ■?/>') : Z?^' = 7}) > 0> ^ ^7 be the measure on A^ defined by 



<(W) 

Then 



r,'({(<7,i//): W=7» W 7 

otherwise. 



rn ' ' ' 



m 

7 
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If r)({((Ti, if)'Ti) : flip' = 7}) > then let r\ ia be the measure on AH defined by 

thMii>}) = { m^www^m PW 7 

otherwise. 



Observe that 



By concavity of entropy, 



which implies 



_ — 1 ™ 

MX'|/3oX') > -^^^({(a^V,): /3^ = 7 })^, 7 ) 

771 

«=1 7 

= ^ A<) : W = 7rr 1 })^K 7 . r = M^I/3°^)- 

771 1 
«=1 7 

This proves Case 1. 

Case 2. Suppose k is supported on the singleton {a}. Then re'-a.e. a' is conjugate to cr, 
so k' is supported on a finite set which we will denote by {a\, . . . , a n }. For each % there is 
an element Tj G Sym(m) such that <Tj = T^arf . Let 7/ be the measure on Sym(m) G x A'™! 
defined by 

t/({K^)}) := /c'({<7i}M{(M ° r< ) : 1 < * < n}). 
Then 77' projects to and 

ip(ai(g)p) = tpo Ti ((r(g)T^ x p), Vp G [m] , ip G A H , l<i<n,geG 

implies <iy^(X,X) = where X is the approximate process constructed from rj'. It also 
implies @. If ^ is the probability measure obtained from rf by restricting to {(Xj} x A'™' 
and normalizing to have total mass 1 and if X i is the approximate process constructed from 
ril then X is isomorphic to X i in the obvious sense. Thus h(X\/3 o X) — o X { ). So 

n 

Mx'i/Jox') = Y, h (X\P°X)K'(W) = h(x\poX). 
1=1 

This proves Case 2. 

Case 3. Suppose that there is an element a G Sym(m) G such that K-a.e. a' is conjugate 
to cr. Then «'-a.e. a' is also conjugate to a. By case 1, there exists a measure rjo on 
Sym(m) G x A'™! whose first marginal is the Dirac measure supported on {cr} such that if 
X is the approximate process constructed from rj then d w (X,X ) = 0, equation (j2J) holds 
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with rf replaced by r/ and h(X \{3 o X ) > h(X\{3 o X). By Case 2, there exists a measure 
rj on Sym(m) G x A™ whose first marginal is k' such that if X is the approximate process 
constructed from rj then dw(X , X ) = 0, equation (J2J) holds with rj replaced by 770 and 
h(X'\(3 o X) > h(X \p o X ). So cf w (X',X) = and h(x'\/3 o X~') > /i(X|/3 o X) and 
equation (j2J) holds. 

Case 4. Now we handle the general case. This case follows from the previous one by 
disintegrating k and k' over the set of conjugacy classes. To be precise, let [Sym(m) G ] be the 
space of conjugacy classes of Sym(m) G . This is the quotient of Sym(m) G by the conjugacy 
action of Sym(m). Let tc : Sym(m) G — > [Sym(ra) ] be the quotient map. For each conjugacy 
class c e [Sym(m) G ], let k c , k' c be the fiber measures of k, k' over c respectively. Likewise, 
let rj c be the fiber measure of rj over c (so k c is the projection of rj c to Sym(m) ). 

By case 3, if X c is the approximate process constructed from r] c then there is a measure r)' c 
on Sym(m) G x such that the projection of rj' c to Sym(m) G is k' c , if X c is the approximate 
process constructed from rj' c then dyy(X c , X c ) = 0, equation (j2J) holds for r\ c and rj' c in place 
of r] and r]', and h(x' c \/3 o x' c ) > h(X c \/3 o X c ). 

The hypothesis that k and are conjugate implies tt^k = tt*k'. Let rf — J rj' c d7r*K(c) 
and X be the approximate process constructed from 7/. So, 

/i(x'|/3 o X') = y /i(x' c |/3 o x' c ) dir*K(c) 

> J h(X c \/3 o X c ) rfTr^(c) = /i(X|/3 o X). 

Because (ivy(X c ,X^,) = 0, it follows that dw(X,X) = 0. Also equation (J2]) holds. This 
finishes the lemma. 

□ 

Lemma 5.7. Lei /t, k' be probability measures on Sym(m) , Sym{m') G respectively. Let rj 
be a probability measure on Sym(m) G x with projection k and let X be the approximate 
process constructed from 77. Suppose k and k' are (W,e)-close where W C G is a finite set 
containing the identity and < e < 1. Let (3 : A — > B be a map to a finite set B. Then there 
is an approximate process X constructed from a measure v( on Sym{m') G x A^ m ^ such that 

1. the projection of rj' to Sym{m') G is k' ; 

2. d w (X,x) < Q\W\e, 

3. h(x'\f3oX') > (1 -e) 2 h(X\/3oX) -2(1 -e) 2 elog|A|. 
Moreover, if X = (T, X, /i, 0) is a process over G then for any e > 0, 

t/({(<7,V): d w ((o-,il>),<f>)<(4\W\ + l)e}) > (1 - e)rj[{{a^) : d w {{a^), <f>) < e}) - e. 
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Proof. Let d be as in Definition [T2J So $(C/(W,e)) > 1 — e where Q(W,e) is the set of 
all (a, a') G Sym(m) G x Sym(m') G that are (W, e)-close to each other. Thus, for every 
(c, c') G £7(W, e) there exists a bijection : Q a ,a> Q' a a' between subsets Q a ,a' C [to], 
Q'o-o-/ C [m'] such that 

1. cr'(w)f3 a ^i(q) = f3 a>a i(a(w)q) for all w G W and g G Qa,a' with a(w)q G Q CT ,cr'; 

2. <r(«;) / 9-^(g') = /3-;»)g') for all w G W and q' G <2' CTCT , with a'{w)q' G Q^; 

3. |Q^| > (l-e)m, |Q^,| > (1 - e)m'. 

After removing elements from each Q a y and a , if necessary, we may assume, without loss 
of generality, that there exists a number m" < min(m, m') so that to" > (1 — e) max(m, m') 
and to" = |<5 CT ,o-'| — IQ^o-'I f° r every (cr, a') G ^(W, e). There exist elements r a ^i G Sym(m) 
and T' apt G Sym(m') such that t^Q^ = [m"], T' a yQ' aa , = [m"\ and t^ .,/3 (Tj0 ./t~^ is the 
identity map on [to"] . Let be the measure obtained from ■& by pushing forward under the 
map defined by: (a, a') >->■ (t^ot"^, T f a a io{T' aa ,)~ l ) for (c, c') G Q{W,e) and the map is 
equal to the identity on the complement of Q(W, e). The marginals of d' are conjugate to k 
and k' respectively. By the previous lemma, therefore, we may assume that d' = d. 

About notation: if /z is a measure on a space X and tt : X — > Y is a Borel map, we let 
/i y denote the fiber measure of \i over y G Y. It is a measure on X supported on Tc~ l (y) and 
/i = f fiy d7r*fi(y). Note that \i y depends on n but this dependence is left implicit. If \i is 
a probability measure then p y is a probability measure for 7r*/i-a.e. y G K. If a : X — )■ K 
and (3 :Y Z and /i is a measure on X then p z = f p a (x) dp z {x) and a*(/i z ) = (a*//) 2 for 
/3*a*/i-a.e. z E Z. 

Case 1. Suppose $(G(W, e)) = 1. We will prove that there exists a probability measure 
v( on Sym(m') G ' x which projects to k' such that if X is the approximate process 

constructed from rj' then d w (X',X) < 4e\W\, h(x'\/3oX) > (l-e)/i(X|/?oZ)-(l-e)elog \A\ 
and 

v'({(<t,iI>): d w ({a,^),<f>) <(A\W\ + l)e}) >ri({{a^) : d w ((a, V>), 0) < e}) . 

Let 7r : Sym(m) G — >■ Sym(m") G , tt' : Sym(m') G — > Sym(m") G be Borel maps satisfying 

1. if a G Sym(ro) G , w £ W,p £ [to"] and a(w)p G [to"] then 7r(cr)(u;)p = o~(w)p; 

2. if a' G Sym(m') G ,w G W 7 , p G [to"] and o~'{w)p G [to"] then tt'Cct') = a'(w)p; 

3. if cr G Sym(ro) G and a'{m') G satisfy a{w)p = a'(w)p for every w G W and p G [m"} 
such that a(w)p G [to"] and a'{w)p G [to"] then 7r(cr) = 7r'(er'). 

Because we assume e)) = 1, it follows from these assumptions that 7r*K = 7t^k'. Let 

k" denote ir*n. Let i2 : AH -> Al m "] be the restriction map. Let 77" = (7T x i?)*ry be the 
pushforward measure and X be the approximate process constructed from rj". 
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Note that if ( is any probability measure on AH then H(R*() > H(C)-(m-m") log | Aj 
Thus, 

h(X\(5oX) = - [ Hfaw) dr)(<J,il>) 

771 J 

! r jji — 777," 
< - / H((id x R)*(ri aM )) dr](a, V) + log \A\ 

TDj j TTXi 

where id denotes the identity map on Sym(m) G . By the definition of fiber measure, 



Therefore, 

J {id x R)*(rj (Ti i3- l pi) drj a: i3R^(a,^p') = ((id x R)*rj) a fiR^. 
So by concavity of entropy, 



H((id x d-q^R^o,^') < H(((id x R)*r]) a ^). 

By integrating over all (a, iff) G Sym(m) G x A^ m \ we obtain 

J H((id x R)*(r) aM )) dr](cr,^j) < J if (((id x R)^) a ^ R ^j dr](a,ip). 

Note n x id is injective on the support of ((id x R)*t]) u ^ R ^. So 

if (((id x R)*r]) at /3 R ^j = if ((vr x id), (((id x i?)*^,^)) ■ 

Note that ((id x R)*r]) a ^ = (id x R)*(Va,0R*l>)- Therefore 

(vr x id)* (((id x R)*r]) a ^) = (tt x R)*(i] a ^ Ri ,). 
Because / 77^,^ drj^R^a' = r] wam4> , it follows that 

y (tt x R)^^ dri^R^a',^') = (tt x R)*(r] na ^) = rf^pw 
By concavity of entropy, 

y //((tt x R)^^) drj^R^a',^') < H(r]'^ m ^). 

So, 

H^((idxR)^r]) a ^R^j dr}(a,ip) = J H^ttxR)^^)) dr)(a, ip) < J H(ri"^ Ri ,) dr}(a,ip). 
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Since ^=sl. < e, 

h(X\/3oX) < e\og\A\ + ^-J H(rjl^) drj(a^) 

= e log|A| + l J H(rt„ M „) d V "(a",r) 

= e\og\A\ + —h(x"\ r 3oX"). 
to 

Next let R' : A^ be the restriction map. We claim that there exists a probabil- 

ity measure 7/ on Sym(m') G x A^ m '^ such that the first marginal of 7/ is k' and (n' x R')*T)' = 77". 
To see this, let a G A be an arbitrary element. For ip G 1, define t/v G A'" 1 ' by 
^a(p) = 4>(p) if p € and ip a (p) = a if p G [to'] \ [to"]. For (<r, ^) G Sym(m") G x 1, 
let 77^ be the probability measure on Sym(TO') G x A^ whose first marginal is n' a and has 
support contained in {(a', ip a ) : 7r'(er') = cr}. That is, 77^ is the direct product of K f a and the 
Dirac measure concentrated on if) a . Then 7/ := fr)' ai p df]"(a,ip) satisfies the claim because 
k" = (7t')*k,' implies the first marginal of 77' is k' as required. 

Let a : A^ ->■ ^I" 1 '] be the map a(^) = Let p : Sym(m") G x A^ ->■ ^I" 1 "] be 
the projection map. By construction, for /t"-a.e. cr" G Sym(m") G , 77^.,, = «4„ x a*p*(r)"„). 
Therefore, for /t'-a.e. a, r]' a is the direct product of S a , the Dirac measure concentrated on er, 
with a*p*(77^ CT ). So for 7/-a.e. (cr,^), ^ >/3fl ^ = ^ x a*P*(^'' CT ,/3i?^) and = ^ which 

implies rf p Rl ^ = Va,^- Because R'a is the identity map, 

(tt' x R')*(r)' aM ) = (tt' x x a m p^rfX,^)) 

Let X be the approximate process constructed from 77'. Then 

TTL J Til J 

= hi H ^m^ drf^a,m = ^h(x"\(3oX") 
> (1 -e)h(X\poX) - (1 -e)elog|A|. 

Let x '■ Sym(m) G x A^ x [to] — >■ A be the map ^p) — V'(p)- Define x" : Sym(m") G x 
x [m"] — > A similarly. Then 

d w (X,x") = \ \\xY(v x 77) - ( x ")f (77" x u'% 

where u, u" denote the uniform probability measures on [to], [to"] respectively. So if is any 
probability measure on Sym(TO) G x A^ x [to] x Sym(TO") G x A^ m "^ x [to"] with marginals 
7] x 77 and 77" x 77" then d w (X,x") < &(S) where S = {(a,ijj,p,a" ,p") : X W (v,^,p) ^ 
( X ") W (*",r,p")}- 
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Let 6" be the pushforward of i] x u" under the map (a,tp,p) >->■ (a, if),p, ttct, Rip,p). Let 
u be the uniform probability measure on [m] \ [m"\. Then 



m — m" \ „ ( m" 



rj x u — I I 77 x u Q + I — ) r] x u" 

\ m } \ m 

So if 6 = 77 x -u x 77" x u" and 

e := f^-^- ) e + ( — 1 e" 

then 6 has marginals 77 x u and r/' x -u". Moreover, Q(S) < m ~ r ^ 1 " + ^Q"{S) and, we claim, 
Q"(S) < \W\e. This is because if U is the set of all (a, ip,p, ira, Rip,p) such that cr(g)p G [m"] 
for all G W then [/ n 5 = and 0"(£7) > 1 - \W\^^. Thus 

< 9(5) < + — Q"(S) 

m m 

< m ~ m " + ^( m - m ")\W\<2e\W\. 

m m \ m J 

Similarly, d w (x' ,x") < 2e\W\. So d w (X,x) < Ae\W\. 

By applying the same argument to the Dirac measure concentrated on an arbitrary 
(<7,V0 e Sym(m) G x A [ml we obtain 

d w ((a,ip),(na,Rip)) < 2e\W\. 

A similar statement holds with (V, ■?/>') £ Sym(m') G x A' m ']. So, 

t/({(<7,^): M(^^),0)<(4|^| + l)e}) >v{{M): d w ((a, < e}) . 

This finishes case 1. 

Case 2. This is the general case. Let k±, k[ be the marginals of 1? restricted to e) 
and normalized to have total mass 1. Let r]i — J r\ a dK\{a). So rji is absolutely continuous 
to i]. Let X\ be the approximate process constructed from r\\. 

Observe that 

hiX^oX,) = - [ H( VtrM )d Vl (<r,4>) 

H(r] aM )dr] a {^)d'd(a, a') 



1 1 

m0(Q(W,e)) J gm 
> -e log \A\ + ^-J J H(ri aM )dri a (4>)d$(a, a') 



-e\og\A\ + h(X\/3oX). 
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It is easy to check that dw{Xi,X) < e and 

Vi({{^,^)- d w ((a,ip),0) < e}) > ri({(a,il)) : d w {{a, -0), 4>) < e] 



e. 



By case 1, there exists a measure r}' x on Sym(m / ) G x such that rj x projects to 

k[ and if X 1 is the approximate process constructed from rj[ then dy/{X 1) Xi) < 4e|W / |, 
h(X[\/3 oX[) > (1 -e)/i(X 1 |/3oX 1 ) - (l-e)elogL4| and 



d w ((a,^),<P)<(4\W\ + l)e})> Vl ({(a^): d w ((a, i/>), 0) < e}) . 

Of course, k[ is absolutely continuous with respect to k! . Let rf be any probability 
measure on Sym(m') G x A^ rn ^ such that rj' projects to k' and 

V'i= Va dn^a). 



Let X be the approximate process constructed from rf. Because k' > (1 — e)«^ it follows 
that rf > (1 — e)r][. So 

oX) = —A H( V ' aM ) drj\a^) > (1 - e)^ / d^fa V) 

m' y m' y 

= (l-e)/i(X;|/3oX;) > (l-e) 2 /i(X 1 |/3oX 1 )-(l-e) 2 elog|A| 

> (l-e) 2 /i(X|/3oX)-2(l-e) 2 elog|A|. 

Of course, we also have d\y(X 1 , X ) < e and because r/ > (1 — e)^, 

t/({(<7, V) : d w ((<r,ii>),<f>) < (4\W\ + 1)4) > (! " : d w ((<7,VU) < (A\W\ + 1)< 

Thus 

d w (X,x') < d^X,^) + dwiX^x'i) + d w (x[,x) < Qt\W\ 

and 

d w ((a,iP),<P)<{4\W\ + l)e}^j > (1 - e)r,[ ({((7, V) : d W ((«r, V), 0) < (4|W| + l)e 

> (l-e)? 7l ({((7,V) : d w ((a,iP),cP) < e}) 

> (l-e)77({(o-,-0) : d w {(<r,i(>),<f>) <e})-e. 

□ 

Proof of Theorem \5.5[ Let {Xj}!*^ be a sequence of approximate processes adapted to K' 
(where K' is a subsequence of K) such that lim^oo Xj = X. Let rjj be the probability 
measure on Sym(mj) G x A^ m ^ from which Xj is constructed. 
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Let e > and W C G be finite. Let L = {Aj}°? =1 . By the previous lemma, for all 
sufficiently large j there exists an approximate process Yj adapted to \j such that h(Yj\/3 o 
Yj) > (1 - e) 2 h(Xj\P oXj) - 2elog d w (Y j ,X~ j ) < 6|W|e and if r(- is the measure on 
Sym(m'-) G x A^ m ^ from which Yj is constructed then 

r?;({(a,^): d w {(a, V>), 0) < (i\W\ + l)e}) > (1 - e)^- ({(a,^) : d w ((<7, 0) < e}) - e. 

So if {W n }™ =1 is an increasing sequence of finite subsets of G then for every n there is a 
J(n) > so that for every j > J(n) there is an approximate process Yj adapted to Xj such 
that h(Yj\P o Yj) > (1 - o X,) - i ^(F^X,) < \ and, if lim^X, = X 

strongly then 

r/j({(a,i>) : d Wn {M)A)<-}) > 1 - -• 

Without loss of generality we may assume {J{n)}™=i is an increasing sequence. So lim^oo Vj( n ) = 
X, {Fj( n )}^ =1 is adapted to a subsequence of L and limsup,^^^ /i(Y" n |/3oF n ) > /i(X|/3oX). 
So h(h,X\poX) > h(K,X\0 oX). By symmetry we must in fact have h(L,~X\0 o X ) = 
h(K,X\(3 o X) as claimed. Moreover, if if ]jm.j^ 00 Xj = X strongly then lim^oo Yj( n \ = X 
strongly. So h(h,X\/3oX) = h(K,X \/3 oX). □ 

Next we extend Proposition 14.11 to the relative case: 

Proposition 5.8. Let K = be a random sofic approximation ojTL. Let X be a process 

over Z with range A and let (3 : A — » B be a map. Then h(K., X\/3 o X) = h(K,X\(3 o X) = 
h(X\/3oX). 

Proof. The inequalities 

h(K,X\/3oX) > h(K,X\/3oX) > h{X\(5 oX) 

follow immediately from the definitions and Proposition 14.11 So it suffices to show h(X\(3 o 
X) > h(K,X\/3 o X). By Theorem 15.51 and Theorem 14.31 we may assume K is the non- 
random sofic approximation E = where o~i : Z — > Sym(mj) is the homomorphism 
with o-j(l) = (1,2,..., mj). 

Let X be a large positive integer. If m! i is the integer nearest to rrii that is divisible by 
X and a'i : Z — > Sym(m9 is the homomorphism with cr-(l) = (1,2, .. . ,mQ then {a' i \ c *i l is 
asymptotic to {crj}^. So without loss of generality, we may assume X divides m ; for each 
i. 

Let {X i \°^ l be a sequence of approximate processes constructed from measures r\i on 
Sym(mj) z x A^" 1 ^ (as in definition [6]) adapted to IK', a subsequence of K, such that limj_ 5l00 Xj = 
X and Hindoo h(X i ) — h(0oX i ) = h(K, X|/3oX). Without loss of generality, we may assume 
K' = K. So r)i = 5i x Aj for some measure \ on A^ 711 ^ where Si is the probability measure 
concentrated on {(7j}. 
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Fix i for now. For a,b G [mj let a + b G [mj denote their sum modulo m r Also let 
[a, 6] be the interval from a to 6: [a, b] = {a, a + 1, a + 2, . . . , b}. For example, [m* — 1,1] = 
{mi - l,m;,l}. Let vr^ : ->■ A^ be the projection map and A{ a ' 6] = 7rl a,fc] A J the 

pushforward measure. For iV > a positive integer, let \[ N be the product measure 



X' iN :=X?' N] xXf +1 ' 2N] x...xX^- N+1 > mi] . 



Let (3 [a ' b] : denote the map = In order to simplify 

notation, we write f3 = (3^ a ' b ^ when [a, b] is clear from the context. We claim that 

if (AW - H0, Kn ) £ HM-mw = _ 



by 



For £ G 5 [mi] and C G £ M we let A<(-|0 and A<(-|C) be the measure defined for E C 



Other conditional measures such as /3*Aj(-|C) are defined similarly. 

Let A"^ be the measure on A^ satisfying f)*\'l N = /3*A« whose fiber over £ G fil mi l is 
the measure 

= ^' N] H-\0 x vrf ^AiC-IO x • • • x tt^-W^A^-IO- 
Since ^A-jv = and #(A^(-|£)) > #(A 4 (-|0) for any £ we have 

F(A" ) - H(I3Xn) > H{Xi) - H(f3X)- 



Note 



, rrn/N-l 



m.i/N-1 

= E H(^ N+1 ' jN+N] ^(-\0) dPMt\CW*\r +1 ' jN+N] (0 

3=0 J J 

rrn/N-l 

< Yl H(^ N+1,jN+N] h(-\o) d?x N+i ' jN+N] (o 

3=0 J 

= H(X[ n )-H(/3X,n)- 

The inequality above holds by concavity of entropy. This proves the inequality in fl3]). The 
equality in (j3J) holds by definition. 

For each p G {0, . . . , — 1} let A^ p iV be the product measure 

\/ i\p+la>+N] *\p+N+l,p+2N] x [p+m<-iVH-l,p+m i ] 

A i,p,Af •- A i x A i X • • • X Aj 
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An argument similar to the one proving shows 

HiK**) - H(f3*K, P , N ) > H(Xi) - JTOUi). (4) 

For a,b G [rrii ], define : At 1 ' 6 -"* 1 ] by T^(£)(p) = £(p + a- 1). Let X [ * M be 

the measure defined for sets E C A^' b - a+1 ^ by 

Ai a,fe] (E) := A| a ' 61 (r M £)- 

For C G St 1 ' 6 -"* 1 ] let A^' &] (- |C) be the measure defined for sets E C Al 1 ' 6 ""^ by 

Af'^K) := Ai^ ] (r M £k M C) 
where we have abused notation by letting r' a '^ denote the analogous map from B^ 1,b ^ a+1 ^ — > 

g[a,b] 

Because #(A| p+1 ' p+iV+1] (-|C)) = H(Xf +1 ' p+N+1] (-\r^+ l <P+ N +^C), 



m, P , N ) - ffouu) = E / H(xr +p+1 ' u+1)N+p K-\o) d^xr^^Kc). 

j=0 J 

By©, 

H(Xj) -H(f3*Xj) 1 rr/W \ TT(R\' \ 

h '' m i p=0 

r rij- l „ 

^- E / ^ar^-io) ^AS p+1 - +iV] (c). 

2 r>=f) 



By concavity of entropy, if for £ e £?[ 1,JV 1 3 u;^ is the measure on A^ 1,N ^ defined by 

_ z;r 1 ^ +1 ' p+N \{c})-^ +l ' p+N] (-\c) 

then 

m,,— 1 „ m,i — l „ 

E / ii(\ b+i ' p+jv] (-io) ^Ar +i,p+iv] (c) < e / ^Kc) ^Ar +i,p+jv] (c). 

p=0 ^ p=0 ^ 

Therefore, 

H(X t ) - F(^A0 < J_ y / # ( ) ^At +1 ' p+iV] (C). 
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So if up t i is the measure on B^ 1,N ^ defined by 



up 
then 

H(Xi) - H(/3,Xi) ^ 1 



rrii - 1 

\p+l,p+N] 



- E ^ 

rn ■ 



rrii 



Let X = (T, X, n,<fi). By construction, u^ converges (as i — > oo) to the measure /i^ 
defined for E C At 1 ^ by 

M h fl({xeX: ^o<PY{x) = Q) 

Also, up^ converges Therefore 

i— >oo TTli J iv iV 

The right hand side converges to h(X\(3 o X) as N tends to infinity. This proves the propo- 
sition. 

□ 



6 Orbit equivalence and entropy 

In this section, we prove Theorem II .21 by generalizing a theorem of Rudolph and Weiss which 
is explained next. 

Definition 19. Let G, T be countable discrete groups and let (X, B, p) be a standard prob- 
ability space. Let Grx T (X, B, p) and Trx s (X, B, p) be two probability measure preserv- 
ing actions with the same orbits. We assume that both actions are essentially free. Let 
p : T x X — » G be the cocycle 

p(l, x) := g T g {x) = S 7 (x). 

If A C B is a sub-cx-algebra such that p(j, •) is ^4-measurable for all 7 G T then the orbit 
change from T to S is said to be .4- measurable. The smallest such cx-algebra is called the 
orbit change a-algebra. 

The next theorem is proven in [RWOO]. 
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Theorem 6.1. Suppose T is an essentially free ergodic action of a countable discrete amenable 
group G and A is a T -invariant sub-a- algebra. Suppose also that S is essentially free action 
of T with the same orbits as T (this implies T is amenable and S is ergodic). Suppose the 
orbit change from T to S is A-measurable. Then for any finite observable 4> : X — >• A we 
conclude 

h(T,<j)\A) = h{S,(f)\A). 

The rest of the paper is devoted to proving a related result: 

Proposition 6.2. Let G be an amenable group, X = (T, X, fj,, 0) an essentially free G- 
process with finite range A and S : (X, fi) — > (X, fi) be an essentially free measure-preserving 
Borel automorphism with the same orbits as T (i.e., for fi-a.e. x G X, {T g x : g G G} = 
{S n x: neZ}). 

Let (3 : A — >■ B be a map and suppose the orbit change from T to S is measurable with 
respect to both the T -invariant sub- sigma- algebra generated by ip := /3o0 and the S -invariant 
sub- sigma- algebra generated by ip. Then for any random sofic approximation K to G, 

E(K,X|/3oX) = /i(K,X|/3oX) = h(S,<f>\0o<f>). 

Given the proposition above, we prove: 

Theorem 6.3. Let G be a countably infinite amenable group with random sofic approxima- 
tion K. Let X = (T, X, fi, 0) be a G-process. Then 

h(K,X) = h{K,X) = h(X). 

Of course, this implies Theorem 11.21 

Proof. We will prove the statement for lower-sofic entropy only, the upper-sofic entropy case 
is similar. Let Y = (S,Y,i>,if)') be a Bernoulli process over G with base (B',u) where 
B' is a finite set and u is not supported on a singleton. This process is weakly mixing 
and ip' : Y — > B' is generating. By [Dy59, Dy63, CFW81], any ergodic essentially free 
probability measure preserving action of Z is orbit-equivalent to (S, Y, v\ So there exists a 
weakly mixing automorphism U : (Y, v) — > (Y, v) with the same orbits as S and a Borel map 
ip" : Y — > B" to a finite set B" which generates in the sense that the smallest ?7-invariant 
sigma-algebra on which ip" is measurable is the Borel sigma algebra of (Y, v) (up to measure 
zero sets). 

Let ip : Y — > B = B' x B" be the map ip(y) = (y) , ip" (y)) . Note that the Borel sigma- 
algebra of (Y, v) is the smallest sigma-algebra generated by ip and the G-action (modulo 
measure sets) which is the smallest sigma-algebra generated by ip and U (modulo measure 
sets). 

Define V: XxY->XxY by 

V(x,y) = (T g x,S g y) Uy = S g y. 
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Note that the orbit change from T x S to V is measurable with respect to both the G- 
invariant sub-a-algebra generated by ip and the ^-invariant sub-a-algebra generated by ip. 
Proposition 16.21 and Lemma 15.41 imply 

h„(K, <P) = h^K, x V#) = hp XV {V, <p x (5) 

Let n = J A d((\) be the ergodic decomposition of \i. Because Y is weakly mixing, 

/i x v = / A x z/ d((X) 



is the ergodic decomposition of fi x z/. It is well-known that the classical entropy of a process 
equals the integral of the entropies of its ergodic components. So Theorem 16.11 implies 



K XV {V, x V#) = / W(V, x ViV) dC(A) 

h Xxv (TxS,<j>xil>\il>) d((\) 
= h„ xu (T xS,<j>xi{>\il>)=h l i(T,<l>). 

So (jSJ) implies /i M (K, 0) = h^(T,<j)) as required. □ 
6.1 Lifting factors 

Proposition 16.21 is proven by "lifting" the orbit-equivalence to sofic approximations. But 
first, we "lift" factors that do not necessarily come from composing with a map (3 : A — >• B. 

Definition 20. Let {X i }^ 1 be a sequence of approximate processes over G constructed 
from a sequence {rii\ c *L l of probability measures on Sym(m i ) G ' x A^ 1 ^. If W C G is finite, 
a G Sym(m i ) G and £ G A [mi] , then let 

efoW] G Z[a,W](p)(w) :=t(a(w)p). 

Let be the measure on Sym(mj) G x (A 147 )'"^ obtained by pushing forward r\i under the 
map 

(a,0 G SymK) G x A™ H- (a, £[a, W]). 

Let denote the sequence of approximate processes constructed from r^. 

The next lemma is immediate. 

Lemma 6.4. If, in the definition above, lim^ooXj = X = (T,X, //,</>) then linij^ooXf 7 = 
(T,X,/i,0 w ). ' 
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Assume that X = (T, X, p, 0) is a process. Let if) : X — > B be a measurable map into 
a finite or countable set B. For each finite W G G, let : A w — )■ i? be a measurable 
function satisfying 

Vv(f) = 6 p^{x G X : if)(x) = b and <p w (x) = 

> p({:r G X : ^0) = c and = f }) Vc G R 

Warning: do not confuse ip w : A w ^ B with V> W : X -> 

Definition 21. Let A be a function on the set of finite subsets of G. We write limw^G A(W) = 
L if for every increasing sequence {Wj}'^ =1 C G with UjWj = G, limj_ >DO A(Wj) = L. 

Lemma 6.5. Let X = (T,X, p, </>) be a G-process such that (f> : X — )■ A is generating. 
Let {X i }°^ 1 be a sequence of approximate processes constructed from a sequence {p^}^ of 
probability measures on Sym(rrii) G x A^ m ^ . Suppose that lim^ooXj = X. Then 

lim if) W o xY = (T, X, p, ip w o 4> w ) , 

i— >oo 

Jim ( r , -X", A*> V'w ° W ) = (T,X,n,i(>). 

Proof. The first limit follows from the previous lemma and Lemma 15. 1[ The second limit is 
a consequence of the fact that (f> is generating. □ 



6.2 Lifting orbit-equivalences 

The concepts of the previous subsection are used to 'lift' orbit-equivalences as follows. Let 
X = (T, X, p, (p) be a process over G with range A. Suppose T is a (possibly different) group 
and Y = (S,X,fj,,<p) is a process over T with the same orbits as G (up to p-measure zero). 
Suppose there is a map (5 : A — > B such that the orbit change from T to S is measurable 
with respect to the G-invariant sub- a- algebra generated by if) := /3 o <f> and that both X 
and Y are essentially free (e.g., for a.e. x G X T 9 x ^ ThX if g 7^ /i). Define the cocycle 
p : T x X ->■ G by 

p(7, x) = g ^ SjX = T g x. 

Lemma 6.6. For each finite setWcG there exists a map pw '■ T x B w — > G such that for 
every 7 G T and g G G, 

lim G p({xeX: p( 1 ,x)=g}A{xeX: p w ( 7 , ifj w (x)) = 0}) = 0. 

Proof. For each 7 G T and g G G let 

A(7,#) = {x G X : p(7,x) = 5}. 

Choose orderings G = {gi, g 2 , . . .}, T = {71,72,...} of G and T. For each n, let P n be 
the smallest partition of X containing X(ji,gj) for all 1 < i,j < n. Because the orbit 
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change is measurable with respect to the smallest G-invariant sigma-algebra X on which i/j 
is measurable, the partitions P n are contained in E (up to measure zero sets). Therefore for 
each n there are a finite set W n C G and sets X n (ji, gj) for 1 < i, j < n such that 

1. Xniji^gj) is contained in the smallest sigma-algebra on which ip Wn is measurable; 

2. X n (7i, £j) n X„(7i, # fe ) = if j 7^ fc (and l<i,j,k< n); 

3. /i(X( 7l ,^)AX ri ( 7l ,^)) < 2-" for all l<i,j<n. 

Let pvk„ : r x B Wn — > G be a map such that Pvf„(7i, V ,W "( a; )) = gj if 1 < i, j < n and 
x G X n (7i,5( i ). 

For any finite W C G, if there is an 2 > 1 such that Wi G W then choose a maximal 
such i and let pw '■= PWi- Otherwise, define pw arbitrarily. The lemma follows immediately. 

□ 

Lemma 6.7 (Asymptotic cocycle identity). For any 71,72 G T, 

hm^r/^ G £ w : M7i72,0 = Pv(luOPw(l2,0}) = 1 

where g 2 = Pw(l2,0> V = W H Wg 2 l , £' = ip v {T g2 x) and x G X is any element with 
ij w (x)=£. 

Proof. The claim of the lemma is equivalent to 

hm G p(^{x G X : p w (^ 2 ,ip w (x)) = p v (li, i> V (T 92 x))p w (l2, ip W (x))}^j = 1 

where V, g 2 are as defined above. For 7 G T,g G G define X(^,g) := {x £ X : p{^,x) = g} 
as in the previous lemma. Also for W C G finite define 

X w (l,g) ■= {x G X : p w (7,^ w (a;)) = g}. 

The cocycle identity p(7i72, 2;) = p(7i, S 12 x)p{j 2l x) is equivalent to the statement: 

^(72,^2) nT- 1 X( 7l ,y 1 ) C A(7 1 7 2 ,^ 2 ) 

(mod 0) for every <?i,<72 G G. Let 

^(71,72,^1,^2) := ^(7x72,^1^2) nX(7 2 , g 2 ) nT J2 1 I(7 1 ,5 1 ) 
>V(7i, 72, 9i, 92) ■= ^h/(7i72, ^1^2) H AV(7 2 , g 2 ) n T^ 1 X y (7i, #1). 

Then {V(7i, 72, <7i, ^2) : <?i,<?2 G G} partitions X by the cocycle equation. Observe that 
*V(7i,72,#i,#2) cjiGl: Pw(7i72,V' W/ (^)) = Pv(7i, V^C^) W(72, 
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So it suffices to prove that 



W^G^ [ U *V(7l,72,01,02) ) = 1- 
\gi,g2eG J 



If V = W n VF<7 2 1 then as VF — >■ (7, V — >■ G. So the previous lemma implies 

= Jim ^(-*w(7i72,0i02)AX(7i72,0i0 2 )) 
= Jim u(X w (72,^2)AX (72,^2)) 
= hm ;/ i(T- 1 X y (7i^i)AT 9 ; 1 X(7i^i)) 

which implies 



= lim n(y(7!, 72,^,^2) Ay w (7i, 72,^1,^2)) 

VK->-G 

for any fixed gi,g 2 £ C. Since {X(7i, 72, <?i, 52) : <7ij<72 £ C} partitions X, for every e > 
there is a finite set 5 C (7 such that 



( |J ^(71,72,^1^2)) > 1-e- 

\3l,32eS / 



Because 



it follows that 



= ^G /i ( U F (7i, 72, #1, #2) A1V (7i, 12,91,92) 
Jirn^/i I (J lV(7i,72,^i,^2) )> 1 - e. 



Since e > is arbitrary, the lemma follows. 



□ 



In general, if Z is a topological space and {zw,i}wcG,ien is a collection of elements of Z 
then we write lim^^-^coo) z w,i — z if for every limit point z w of {^w/jj^i, hm W -s>G zw — z. 

Let be a sequence of approximate processes constructed from measures {7^}°^ 

(adapted to a random sofic approximation K to G) so that lim^oo Xp ;i — (3 o X. For each 
((7,0 G Sym(mi) G x and 7 G T, define ^^(7) : [m*] ->■ [mj by 

^(7)(P) == *(<7)p ^ M7, $[(7, W](p)) = 17. 
Lemma 6.8. For any 71,72 G T, 

, w lim .ViXUnn {{(<r,Z,p) e Symirmf x x [mj : S^(7i)S^(7 2 )p = S^(7i7 2 )p}) = 1. 
yMso /or any o 2 £ 

lim 97i x M mi ({(<r,t,p) e V(m,) G x 5 K] x [mj : pw(7 2 ,£[^, W](p)) = « 2 }) 

(VF,jj— s-(G,oo) 

= fi({x G X : p(7 2 ,x) = g' 2 }). 
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Proof. For gi,g 2 G G, let 

y (91,92) := {x e X : p(7 2 ,x) = g 2 ,p(-fuT g2 x) = g^. 

Note that a.e. x G Y(g 1 ,g 2 ) satisfies p(7i72,x) = giCfe by the cocycle equation. For i > 
and W C G finite let Y^gi, #2) be the set of all (a, £,p) G Sym(mj) G x x [mj such 
that 

£i#2 = /M7i72,ffoW](p)), 

£2 = Pw(72,C[o-,W / ](p)), 

#1 = Pw(ji,€[v,W](a(g 2 )p)), 

o(q\92)v = v(gi)v(g2)p- 

Note that if (cr,f,p) G 0i,# 2 ) then S^^i)^,^ (72> = (7172)?- Therefore, it 

suffices to show that 

lim ^ x u m (|J {Y^(<?i, <? 2 ) : ft,ftGG})=l. (6) 

We claim that for every #1, g 2 G G, 

lim r]i x w mi (Y; w (^1,5-2)) = /i(Y(g l ,g 2 )), Vg u g 2 G G. (7) 

(Ty,i)->(G,oo) 

To see this, for any finite W C G and (71,^2 G G, let Z(W; gi, g 2 ) be the set of all x G X 
such that 

#102 = PW (71 j2,1p W (x)) 

g2 = Pw(i2,^ w (x)) 
g\ = pw{ii^ w {T g2 x)). 

Then 

lim r]i x u m (Y^ (g 1: g 2 )) = p(Z(W; g u g 2 )). 

1— too 

This follows from Lemmas 16.41 and 15.11 Lemma 16.61 implies 



Therefore 



lim G p(^{x G X : g x g 2 = p w (lil2, ip W (x))}A{x G X : = p( 7l72 , x)} j = 

#2 = Pw(72, ip W (x))}A{x G X : ^ = p( 72 , x)} j = 
lim e .X" : g 1 = p w (ji, ip w (T g2 x))} A{x G X : g x = p(^T g2 x)}j = 0. 



01, 02)) = p(F(^i, 2 )) 



which implies the claim. Because 
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for any e > there exists a finite set G' C G such that 

fi(\J{Y( gi ,g 2 ): 9l ,g 2 eG'}} >l-6. 

By the claim, 

lim Vl x u mi (|J {l^foi, a,) : 9i, 92 G G'}) > 1 - e. 
Since this is true for every e, 

,w, liE ^ ^ X Um < (U { Y ™ (91,92) ■ 9i,92 G G}) — 1 

(W,i)->(G,oo) / 

which implies the first statement. 

To see the second statement, observe 

{(<7,£,p) G SymK) G x x [mj : Pw ( l2 ^[a, W]{p)) = g' 2 } fl {3^(<7i, <fe) : <?l, <?2 G 

= UWV^): ft 6G}. 
Because Y^gi,^) is disjoint from IT 14- ' (5^, (72) if 9i 7^ #i equations <Q and (TTJ) imply 

liminf m x Mmi ({(cr,f,p) G Sym(m;) G x B [m * ] x : Pw(j2,Z[cr,W){p)) = g' 2 }) 

(VK,j)— >(G,oo) 

= v(\J{Y(gi,g 2 ) : jieG})=(i({iel: p(72,z) = g'2}) 

as required. 

□ 

Lemma 6.9. There are maps Sw,a^{l) G Sym(mi) (for every W,a, £,7) such that for every 
7Gl\ 



lim liminf^ Xu mi ({(<7,£,p) G VK) G x5 W x [m,] : S^hXp) = S w>a> ^){p)}) = 1. 

W— »G «— too 

Proof. It follows from the previous lemma that for any 7 G T, 

lim liminf ^^({(tr^p) G SymK) G x BN x [mj : ^W^^Xp) = p}) = 1- 
Therefore, S' Wa ^ is asymptotically surjective, in the sense that 

lim liminf ^ x u mi ({(a, £,p) G Sym(m i ) G ' x B [m ' ] x [mj : p G image Sw^M}) = 1. 
This implies the lemma. □ 
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Let Ow t i be the measure on Sym(mj) r x B^- m ^ obtained by pushing rji forward under the 
map 

(a, G SymK) G x BH _> £). 

Let F / g ) p^j be the approximate process constructed from 9w,i and K P Vi be the projection of 
9 W)i to Sym(mj) r . 

Lemma 6.10. {Ky^j}ien,WcG is a random sofic approximation to F in the following sense. 
For every 71, 7 2 G T 

lim K p Wi x u mi ({(a,p) G Sym(mi) r x [m*] : <7(7i)er(7 2 )p = a(7i7 2 )p}) = 1 

(W,i)— >(G,oo) 

and /or every 71 7^ 72 G T, 

m,-^ x^fe X Um ^ G 5 ?/ m ( m i) r X K] : ff (7l)P 7^ ^(72M) = I- 

(W,i)— >(G,oo) 

Proof. For the first assertion, note that by definition of K p Wi and the previous lemma 
lim x w mi ({(<r,p) G Sym(mi) r x [mi] : o{^ x )o{^ 2 )p = a(ya 2 )p}) 

(Wji)->(G,oo) 

= lim 7?j x w mi ({(cr,£,p) G Sym(m;) G x B 1 " 1 * 1 x [m 4 ] : SV, CT ,£(7i)>SV<x,?(72)p = SW.^TiTaM) 

(W,i)^(G,oo) 

= ,w. liin ^ ^ x Um > W^^p) G s y m K) G x B[mi] x K] : ^,^(71)^,^(72)^ = s' w ^ lxl2 ) P }) 

(W,i)— >(G,oo) s s s 

= 1. 

The last equality follows from Lemma 16.81 

Because the first statement is true, to prove the second statement it suffices to show that 
for any 7 7^ e, 

,r, r } iia L yw,i x u m, {{(<?, p) G Sym(m;) r x [ mi ] : p ^ <r(j)p}) = 1. 

(W,i)— >-(G,oo) 

Equivalently, 

lim 7/j x ({(cr,£,p) G Sym(m;) G x x [mj : p 7^ SV,<t,*(7)p}) = 1- 

(W,i)-»(G,oo) 

By Lemma [6T91 and the definition of S' Wa ^, it suffices to prove 

lim r)iXu mi ({(cr,£,p) G Sym(m;) G x x [m*] : p ^ <r(pw(y,£[<T, W)(p)))p}) = 1. 

(W,i)->(G,oo) 

If G G \ {e} then because K is a random sofic approximation to G, 

lim r?i x u mi ({(<r,£,p) G Sym(mi) G x B [rril] x [mi] : p ^ <j{g)p}) = 1. 

i— ►oo 

Also, if 7 is not the identity element then by Lemma [6781 and because Tr^(X, fi) is essentially 
free, for every e > there is a finite set V C G \ {e} such that 

liminf rux Umi {{(*,£, p) G SymK) G x x [mj : p w ( T ,£k, W](p)) G V}) > 1 - e. 

(W,i)— >(G,oo) 

Because e > is arbitrary this inequality and the previous equation imply the lemma. 

□ 
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Lemma 6.11. The following hold. 

1. ]hn.(w } i)-+(G,<x) Yp,w,i = /3 oY. 

2. Xp t i = (3 o X strongly then \im^w,i)^>(G,oo) Y p,w,i = fi°Y strongly in the sense 
that for all finite V C V and e > 0, 

lim ^({(a,()eVN r xBN; d v ((a, £), P ° 0) < e}) = 1. 

(W,i)— >(G,<x>) \ / 

Proof. The first statement is equivalent to stating that the \^-local statistics of Y p t w,% con- 
verges to the V- local statistics of o Y for every finite V C T. 
For W C G finite define 

Ly m : Sym( mi ) G x B [m ^ x [mj -> B v 

by 

By definition, the \^-local statistics of Y p t w,i is {Lv,w,i)*Vi x u m l - 
Let o;vK,i be the W- local statistics of -X^. To be precise, if 

X w,i ■ Sym(m l ) G x BW x [m,] -> 5 W 

is defined by 

then uj Wti = {xw,i)*Vi x «™ r 

Fix b G B. Define $ WtV : B w B v by 

^,v(0(7) :=£(M7,0) 

if 7 G V and pvy(7,0 G W. Set $w,v(Q(l) '■= bo otherwise. We claim that ($w,v)*vw,i is 
asymptotic to (Lv,w,i)*Vi x u mi i n the sense that 

lim \\($w,v)*ww,i ~ {Lv,w,i)*Vi x ^mjli = 0. (8) 

(W,i)-+(G, 00) 

According to Lemma I631 and the definition of SV j(Ti ,e, for all 7 G V, 

x Wmi ({fof.P) : Sw,aAl)P = °(Pw(l,£[o-, W](p)))p}) = 1. 

Because lim^oo X^ j = (3oX, ]xm i -+ 0O = ip^fM- So by Lemmas 16.41 and [6T6l for all 7 G V, 



lim ^iX« m .({((7,e,p): M7,^^](p))eW}) = l. 

(W,i)->-(G,oo) 



Therefore, 



(W,i)->(G,oo) 
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This implies (E]). 

By Lemmas 16.41 and 15.11 

lim ® W y O X W = (T, X, fi, $ W)V l/> W ). 
% — ^oo 

Because $w,V0^/^ converges to as — >• G, 

lim f^oI^(T,I^,f). 

(W»— ►(G,oo) 

By (|HD this means that the VMocal statistics of Y p,w,i converge to which is the V- 

local statistics of f3 o Y. This proves the first statement of the lemma. The second statement 
is similar. □ 

In the previous lemma we used a sequence {Xp^}^ of approximate processes over G 
converging to (3oX to construct a sequence {V"/3 ) w;i}ieN,wcG that converges to (3oY (there is 
a slight abuse of notation here since W varies over all finite subsets of G instead of all subsets; 
we will continue this abuse below). In the next lemma, a sequence of approximate 

processes over G such that (3oXi = X^ is used to construct a new sequence {Yw,i}i&i,w<zG 
of approximate processes over V satisfying various properties. 

Lemma 6.12. Given a sequence {Xi}^l 1 of approximate processes overG such that [3oXi = 
Xp yi there exists a collection {Xw,i\i&H,WcG °f approximate processes over T such that 

1. (3 o Y Wji is equivalent to Yp tW>i . 

2. if lim^oo Xi = X then lim W i)^(G,oc) Y w>i = Y. 

3. //linij^ooXj = X strongly then lifn.(w,i)^>(G,oo) Y~w,% = Y strongly. 
I h(Y w>i ) - h(P o Y WA ) > h(Xi) - h{fl o X { ) W, i. 

Item (3) means: for all finite V C T and e > 0, 

t lim d w J{(a,OeSym( mi fxA^: d v ({a, OA) < e}) = 1 
where Yw,i is constructed from measures 9w,i on Sym(iiii) r x ^4i mi ]. 

Proof. Let fji be the probability measure on Sym(mj) G x A^ m ^ such that Xi is constructed 
from fji. Let 9\y,i be obtained by pushing fji forward under the map J = Jw,i defined by 

J(cr, = (5W,<7,/9o£,0- 

Let Yw,i be the approximate process over T constructed from 9w,i- 

Since (3 o Xi = Xp^, it follows that (3 o Yw,i is equivalent to Yp t w,i- The proofs of 
statements (2) and (3) are similar to the proofs of (1) and (2) of Lemma [6.111 
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To prove the last statement, fix i and W. Let (3 : Sym(mj) G x — > Sym(mj) G x 
be the map $(cr,£) = (cr, /? o £). Let fj^^^ be the fiber measure of fji over ft (a, £). Thus 

By abuse of notation, we also let /? denote the same map with G replaced by T. We define 
the fiber measure &wipj(an °f over /^(^( a >0) similarly. Thus 

i'H J I' H J 

Let a : Sym(m;) G x A'"^ — >■ A' mi ' denote the projection map. By abuse of notation, we 
also let a denote the projection map from Sym(mj) r x to A^. Because a J = a and 

a *(ViM^)) = a *$w,iM°£)> v ^ G s y m K) G x A[mi] - 

Since a is injective on the support of @wipj{a(,)i ^ follows that 



We claim that 



To see this, suppose X, F, Z are any three Borel spaces, /i is a probability measure on X 
and 7Ti : X — )■ F, 7r 2 : F — > Z are Borel maps. Then for (7r 2 7r 1 ) i)I /i-a.e. z G Z, /i 2 = 
/ A*tti(x) d/jL z (x). This follows from the fact that if Ey, E z are the sigma- algebras on X 
obtained from pulling back the sigma-algebras on F, Z respectively and / is a bounded 
function on X then E[/|Ez] = E[E[/|Ey]|Ez]. To see how this applies to the equation 
above, set X = Sym(m;) G x A^\ Y = Sym(m;) G x and Z = Sym(m;) r x B^l Let 
n i = (5 and choose ir 2 so that 7r 2 7ri = j3J. 
Therefore, 

By concavity of entropy, 

Because a is injective on the support of fjiy^i, H(a#fji t(r i t p^) = H(fjiy t ^i). Therefore, 
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Thus, 

h(Y Wji ) - h(p o Y Wii ) > — [ H(r) iyM ') d ViJj(^ a '' O 




= h(Xi)-h(PoXi). 

□ 

We can now prove Proposition 16.21 whose statement is: 

Proposition 16.21 Let G be an amenable group, X = (T, X, fi, <fi) an essentially free G- 
process with finite range A and S : (X, /i) — > (X, fi) be an essentially free measure-preserving 
Borel automorphism with the same orbits as T (i.e., for fi-a.e. x G X, {T g x : g G G} = 
{S n x : n 6 Z}j, 

Let (3 : A — > B be a map and suppose the orbit change from T to S is measurable with 
respect to both the T -invariant sub-sigma-algebra generated by tp := /3o0 and the S -invariant 
sub- sigma- algebra generated by ip. Then for any random sofic approximation IK to G, 

h(K,X\/3oX) = h(K,X\PoX) = h(S,(f)\f3o(f)). 



Proof Let V — Z. Let be a sequence of approximate processes adapted to IK', a 

subsequence of IK, so that 

1. linij^oo Xi = X; 

2. ]im i ^ 00 h(X i )-h{fioX i ) = h(K,X\(3oX). 

Let Xp } i — j5 o Xi. Let {Y p,w,i}ien,wcG be constructed as in the paragraph before Lemma 
16.101 Let {Y Wti } i€N ^ WcG be the collection of approximate processes given by Lemma 16.121 

A diagonalization argument implies that there exist increasing sequences {Wi}^ and 
{k i }°l l so that if Y i := Y Wuk . then lim^oo Yj = Y = (S, X, /i, 0). Moreover we may assume 
(using Lemma 16.101) that if IK P := {k^. k then K p is a random sofic approximation to Z. 
So Lemma [6.121 implies 

h(K,X\/3oX) = lim h(Xi) - h(/3 oXA < limsup/i(F i ) - h(/3 o¥i) 
< h(K p ,Y\(3 oY) = h(Y\/3oY). 

The last equality follows from Proposition 15.81 Since h(1K, X\/3 o X) > h(K, X\(3 o X) it now 
suffices to prove h(K, X\f3 oI)> h(Y\/3 o Y). 

Let integers mj be given so that IK' = where each k[ is a probability measure 

on Sym(mj) G . Let L = {Aj}?^ be a random sofic approximation to Z where each A, is a 
probability measure on Sym(mj) z . By Proposition 15.81 there exists a sequence {Yj} ig N of 
approximate processes over Z adapted to L', a subsequence of L such that 
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1. lim^ooF- = Y strongly; 

2. \mi i ^ 00 h(Y , i )-h{poY' i ) = h(L,Y\poT) = h(Y\poY). 

Let {X J g j v r ,i}ieN,vcz be constructed as in the paragraph before Lemma 16.101 with the roles of 
X and Y swapped. This construction is possible because the orbit change is, by hypothesis, 
measurable with respect to the smallest sigma-algebra generated by ip and the Z-action. Let 
{Xv,i}iem,vcz be the collection of approximate processes given by Lemma [6.121 According 
to that lemma, 

1. lim(v;i)_>( Z)00 ) X' Vi = X strongly. 

2. h(x' VA ) - h{fi o J? v .) > h{Y\) - h(/3 o ¥[) W, i. 

So 

h{Y\P o F) = lim ^i(F') - h{p o ¥■) < liminf h(X' Vi ) - h((3 o x' Vi ). 

«->oo (y,t)-)-(Z,oo) 

A diagonalization argument and Lemma 16.101 imply that there exist increasing sequences 
{Vi}^ and {k i \ c *L 1 so that if X i := X v _ k . then {X j } c *L 1 is adapted to a random sofic 
approximation W of G. So the inequalities above imply 

h(Y\/3oY) < Mm sup hCX^ - h(fi oX[) < h(K",X~\f3 o X). 

i— >oo 

Observe that K" = {k'[}°? =1 where each k" is a probability measure on Sym(nj) G and {rii}^Z 1 is 
a subsequence of {rrii}^. It follows from Theorem 14 . 3 1 1 hat K" is asymptotic to a subsequence 
of IK'. It follows from Theorem 15.51 that 



h(K",X\P oI)< h{K', X\P oI)< h(K, X\P o X). 
So h(K,~X\P o X) > h(Y\poY) as required. 



□ 
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